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RELATED APPLICATIONS 

Reference is hereby made to the following co-pending, commonly assigned, U.S. 
patent applications: Serial Number 08/719,163, entitled, "INTERACTIVE INFORMATION 
TRANSACTION PROCESSING SYSTEM WITH UNIVERSAL TELEPHONY 
GATEWAY CAPABILITIES," the disclosure of which is incorporated herein by reference. 

TECHNICAL FIELD 

This invention relates to interactive voice response systems and, more particularly, to 
a voice response system that has the capability of being extended for local execution on a 
telephone unit or communication device. 
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BACKGROUND 

Interactive voice response systems (IVRs) are well-known and well-used by 
corporations and governmental entities alike. In many popular applications, IVRs allow such 
entities to handle numerous incoming calls from consumers, employees, or constituents 
without requiring a prohibitively expensive number of phone operators. Companies may 
5 typically conserve operator resources for use with only the most complex tasks by off-loading 
simple informational tasks to the automated IVRs. 

For example, the banking industry has made great use of IVR technology to conserve 
and reduce the expense of operator resources. Many bank customers call into a bank to find 
out simple items of information, such as account balance and last recorded deposits or 

10 withdrawals. This type of information is easily retrieved using the bank's database. An IVR 
may generally be programmed to answer the customer's phone call, determine the complexity 
of the information desired by the customer and then either present such simple retrieved 
information as account balance or account activity to the user using synthesized or pre- 
recorded voice messages, or transfer the customer to a live operator to handle the more 

15 complex tasks. This automation of the simple tasks relieves the operators from inefficient 
application and conserves their resource for the more complex tasks. 

In general, modern IVRs were developed during the evolution and advancement of 
telecommunication networks and equipment. In early networks, most calling service 
functionality was built into telephone switches. However, because of the enormous expense 

20 of telephone switches, advances in calling services were typically delayed until the scheduled 
addition of new switches and/or replacement or upgrade of existing switches. Service 
providers typically had to wait until the equipment manufacturers determined the appropriate 
time to add such calling service functionality to the switches. This limitation generally 
prevented individual service providers from offering competitive or innovative features 

25 without most other providers offering the same switch-resident features. 
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As switching technology became more and more computerized, computers were 
connected to the switches and given portions of telephone services to perform in conjunction 
with the switches. The extension of telephone network functionality to these peripheral 
computers created an intelligent network (IN) allowing many of the functions previously 
5 executed by the switches to be performed by the peripheral computers. As more of the 
telephone services and functionality was extended to the peripheral computers, the new 
network architecture was renamed advanced intelligent network (AIN) architecture. The 
development of AIN architecture generally increased the availability of calling services, such 
as call waiting, call forwarding, and even interactive voice response. The services, thus, 
10 began to move from large and expensive telecommunication switches, to the new telephone 
system integrated computers now designated service control points (SCPs), intelligent 
peripherals (IPs), and service nodes (SNs). 

SCPs and IPs are basically different hierarchical layers of the peripheral computers 
that were connected and integrated with the switching network. SCPs and IPs typically have 

15 call switching functionality, but also have the processing power to handle user voice and data 
input and make decisions based on this user input. Switches generally route calls to SCPs, 
which use IPs to perform many of the simple tasks, such as voice prompting or digit 
collection. In contrast, SNs are self-contained service providers that typically operate 
autonomously. A switch routes a call to an SN for services such as voicemail or fax server, 

20 which the SN performs without further delegation or input from the switch. Thus, much of 
the calling service functionality has been extended to peripheral computer/servers external to 
the switches in the AIN architecture. 

Most IVRs are connected into the public switched telephone network (PSTN) in order 
to facilitate their call handling functions. With the increase in asynchronous communication 
25 facilities, such as the internet protocol (IP) network, IVRs will need to include the capability 
of providing voice response services to such asynchronous communication formats as voice 
over IP (VoIP), One such IVR system is disclosed in the aforementioned co-pending 
application entitled, "INTERACTIVE INFORMATION TRANSACTION PROCESSING 
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SYSTEM WITH UNIVERSAL TELEPHONY GATEWAY CAPABILITIES." However, 
because VoIP and other asynchronous telecommunication formats are not yet widespread, the 
majority of IVR applications are still overwhelmingly synchronous and connect to the PSTN. 

IVR capacity is typically limited by the number of ports installed in the system. The 
5 ports connect the IVR to the PSTN. For example, an IVR manufactured with only one port, 
may only be able to handle one phone call at a time. Because telephone calls on the PSTN 
are typically circuit-switched, synchronous connections, an entire circuit path is reserved for 
the phone connection for the duration of the call. Even when nothing is being transmitted 
between callers, the circuit generally remains open and, thus, a connected IVR port will 
10 usually remain busy for the duration of the call or, at least, the duration of the voice 

application executed by the IVR. Once the voice response application has been completed, 
with the phone call either handled or forwarded to the appropriate employee or agent, the IVR 
may be able to answer the next call as soon as the port is made available. Therefore, IVRs are 
generally manufactured and customized according to the buyer's expected call traffic. 

15 Typically, IVRs are more expensive with more ports added to the system. Large 

capacity systems are, thus, usually more easily afforded by larger companies. However, even 
large companies may not generally be willing to spend a very large sum of money for an IVR 
system with the total number of ports required to handle the company's expected peek call 
volume. Thus, IVRs are typically purchased considering only average call volumes. While 

20 many consumer's calls will be answered by these IVRs, there will still be occasions when the 
consumer's call is placed on hold for a long time, or, even worse, the call is not answered at 
all. 

Another disadvantage of the necessity of holding ports and circuits open is the 
expense generally connected with the open telephone circuit, especially when a wireless 
25 telephone is connected to a traditional IVR application. While the cost of wireless 

communication is falling, users still typically pay to connect to the wireless network, even for 
calls to the user. 
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In consideration of the limitations of the current technology in IVRs, it would be 
desirable to have an IVR application that did not have a capacity limited by the number of 
ports installed on the IVR system. 

It would be a further advantage to have an IVR system that was not required to 
maintain a circuit connection between the calls and the calling service. 
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SUMMARY OF THE INVENTION 

The present invention is directed to a system and method for an extensible interactive 
voice response application comprising an application server having application logic and 
information stored thereon. The application logic is used for defining at least one voice 
response application resident on the server. The system also works in conjunction with 
5 communication devices for establishing connections with the application server. According 
to a preferred embodiment of the present invention, when a communication device initiates 
its connection with the application server, the server downloads application logic, or portions 
thereof, to the communication device to facilitate operation of IVR functions on the 
communication device. The preferred embodiment application logic may define operations 

10 including voice play and/or record, text-to-speech, voice recognition, dual tone multiple 
frequency (DTMF) input, and/or display multimedia output. Accordingly, the 
communication device should preferably include memory and a processor capable of 
executing the downloaded application logic and locally administering the particular voice 
response application. It should be noted, of course, that application logic of the present 

15 invention may select or enable particular operations based on the feature set of the particular 
communication device. Similarly, the application logic may be provided to the 
communication device in modules or executable portions to accommodate limitations of the 
communication device's resources and/or to provide efficient operation. 

As the communication device executes the voice response application, it may play 
20 audible voice cues according to the particular application. A user may enter responses to the 
voice cues by speaking or entering information using DTMF or other data input format. 
Depending on the users responses and requests, the communication device may speak or 
display responsive information to the user. The responsive information may preferably have 
been downloaded along with the application logic or, after the communication device re- 
25 establishes a connection, may be retrieved from the application server through internal or 
external sources. 
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Interaction of the caller and the application logic may provide functionality such as 
that of a more typical voice response unit (VRU), i.e., a caller may retrieve banking 
information without requiring the service provider to dedicate ports to the particular callers. 
Moreover, the interaction of the caller and the application logic may provide heretofore 
5 unavailable functionality. For example, application logic of a preferred embodiment of the 
present invention may define an Internet voice browsing session allowing a caller to freely 
access any information available over the Internet from a communication device such as a 
mobile or landline telephone. 

The implementation of a preferred embodiment of the present invention provides 
10 several potential benefits over existing IVR systems. Speech recognition and voice 
processing executed on the communication device in the preferred embodiment will 
preferably provide more accurate results due to the voice input being provided at the source 
of the processing. Furthermore, with the local processing, delay time is preferably reduced. 
The extensible IVR system is advantageously more scalable than traditional IVR systems 
15 because the extensible system does not rely on IVR ports to control, and limit, capacity. 
Moreover, the extensible IVR system leverages the existing IP infrastructure, which 
continues incredible growth due to the recent explosion in Internet accessibility demand. 

In addition, the remote communication devices may remain in communication with 
the application server without tying up an expensive port, or otherwise precluding another 

20 communication device from accessing application server. Thus, processing and system 
interaction may be going on with several communication devices at the same time. For 
example, a mobile phone may remain in communication with the server through a suspended 
socket connection to implement the voice response application. At the same time, one or 
more other mobile or landline phones (or any other compatible communication device) may 

25 preferably be downloading or interacting with the server through other asynchronous, 

multiplexed socket connections on the same line. At any required time, the mobile phone 
with the suspended socket connection may re-activate the suspended connection to interact 
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further with the server without requiring the suspension or deactivation of the other 
connections. 

The foregoing has outlined rather broadly the features and technical advantages of the 
present invention in order that the detailed description of the invention that follows may be 
5 better understood. Additional features and advantages of the invention will be described 

hereinafter which form the subject of the claims of the invention. It should be appreciated by 
those skilled in the art that the conception and specific embodiment disclosed may be readily 
utilized as a basis for modifying or designing other structures for carrying out the same 
purposes of the present invention. It should also be realized by those skilled in the art that 

10 such equivalent constructions do not depart from the spirit and scope of the invention as set 
forth in the appended claims. The novel features which are believed to be characteristic of 
the invention, both as to its organization and method of operation, together with further 
objects and advantages will be better understood from the following description when 
considered in connection with the accompanying figures. It is to be expressly understood, 

15 however, that each of the figures is provided for the purpose of illustration and description 
only and is not intended as a definition of the limits of the present invention. 
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BRIEF DESCRIPTION OF THE DRAWINGS 

For a more complete understanding of the present invention, reference is now made to 
the following descriptions taken in conjunction with the accompanying drawing, in which: 

FIGURE 1 is a high-level block diagram illustrating a prior art IVR system; 

FIGURE 2 A is a high-level block diagram illustrating a preferred embodiment of the 
5 present invention; 

FIGURE 2B is a high-level block diagram illustrating a preferred embodiment of the 
present invention showing multiple XIVR. servers; 

FIGURE 3 is a high-level block diagram illustrating an alternative embodiment of the 
present invention; 

10 FIGURE 4 is a high-level block diagram illustrating the internal operation of an 

extensible IVR designed according to a preferred embodiment of the present invention; 

FIGURE 5 is a high-level block diagram illustrating the additional features of a 
communication device compatible with a system according to a preferred embodiment of the 
present invention; and 

!5 FIGURE 6 is a high-level block diagram illustrating an alternative embodiment of the 

present invention. 
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DETAILED DESCRIPTION 

FIGURE 1 illustrates a typical prior art IVR system. Customers usually call into IVR 
102 over PSTN 100 using traditional telephones 10 and 1 1, or standard mobile phones 17 and 
18 over wireless network 105 connected through PSTN 100. IVR 102 is typically connected 
to PSTN 100 using trunk 107, which generally comprises a collection of individual phone 
5 lines usually corresponding to the number of ports available on IVR 102. As customers call 
into the system, IVR 102 answers the calls and runs a voice response application for each 
caller. IVR 102 plays pre-recorded or synthesized voice messages and voice prompts, obtains 
input from the caller either through DTMF or spoken responses. Each voice message is 
played by IVR 102 and transmitted across PSTN 100 or PSTN 100 and wireless network 105 
10 to the caller at any of phones 1 1 and 12 or mobile phones 17 and 18. 

In the operation of the voice response application, IVR 102 will generally search 
database 104 for information responsive to requests made by the caller. In the example given 
above of the banking application, a caller could request to know the available balance of 
his/her account. IVR 102 would typically receive and interpret the request from the caller, 

15 and then search database 104 for the account balance using input from the caller, such as 

account number, password, etc. When IVR 102 finds the responsive data, it generally audibly 
communicates the account information to the caller. In order to communicate the responsive 
information to the caller, IVR 102 may either play pre-recorded voice messages or use 
synthesized text-to-speech technology. The technique used will usually be determined by the 

20 programming and capabilities of IVR 102. 

For complex interactions, IVR 102 generally forwards a caller to an agent or 
employee typically stationed at a computer-telephone integrated workstation (not shown). 
Once the caller has been forwarded to the agent or employee, the port into IVR 102, on which 
the caller interacted with the voice response application, will usually be released and 
25 available to take the next call. 

FIGURE 2A illustrates an extensible IVR (XIVR) system according to a preferred 
embodiment of the present invention. The concept of an extensible system is based on 
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running a system application, or portions thereof, on an external device. In the context of an 
IVR, an XIVR allows the voice response application to run locally on the accessing 
communication devices. In order for an XIVR system to operate, the accessing 
communication devices should preferably have, or have access to, basic processing 
5 capabilities and memory for storing the executable application logic. 

In the preferred embodiment of the present invention shown in FIGURE 2A, XIVR 
server 202 is connected to a data network, Internet 200, using data connections 201. 
"Callers" preferably access the system through an asynchronous data network, such as 
Internet 200. The access may be implemented in any manner available to access Internet 200. 

10 For wireless users, a "caller" may access XIVR server 202 using mobile phone 26, laptop 
computer 25, with wireless modem capabilities, and/or hand-held computer 27, also with 
wireless communication ability. The wireless users preferably connect to Internet 200 over 
wireless network 105, and then to XIVR server 202 over data connections 201. Users may 
also preferably access XIVR server 202 through direct connections to Internet 200 using 

15 computer 24, internet telephone 23, and/or hand-held computer 22; or may connect to Internet 
200 via PSTN 100 using compatible phones 20 and 21. 

Through its connection to Internet 200, XIVR server 202 may be addressed using an 
Internet protocol (IP) layer address or uniform resource locator (URL). As users access 
XIVR server 202, server 202 preferably downloads the voice response application logic, or 

20 portions thereof, to the user's communication device. Because the connection is made 
through Internet 200 and data connections 201, all data is advantageously transmitted 
asynchronously between XIVR server 202 and the user's communication device. This 
preferably allows more than one user to be connected to XIVR server 202 at the same time 
over a single line of data connections 201, thus preferably reducing the total number of data 

25 connections per user generally required for XIVR server 202. 

Once the application logic has been downloaded, the communication device 
preferably runs the application locally. Using mobile phone 26 as an example, XIVR server 
202 downloads the application logic, or portions thereof, to mobile phone 26. When the 



870224 



47524-P 1 25US- 1 0025004 



12 



PATENT 



application logic has been downloaded, the socket connection with XIVR server 202 is either 
closed or suspended pending any further interaction with XIVR server 202, Mobile phone 26 
then runs the application. Alternatively, mobile phone 26 may begin running the voice 
response application before the application logic, or portions thereof, is completely 
5 downloaded. Additionally or alternatively, initial portions of the voice response session may 
preferably come from XIVR server 202 directly, while the application is loading to mobile 
phone 26. Such features would preferably allow a more seamless interface with the user. 
Voice messages and prompts are preferably played for the user directly over mobile phone 
26's speaker. The user may respond as usual to the voice messages and prompts. 

10 In the preferred embodiment of the present invention, the user's responses will 

preferably be processed at mobile phone 26. Therefore, the user's response messages will not 
have to be transmitted back to XIVR server 202, subjecting the audio to signal degradation 
caused by the noise injection typical in such transmission. Alternatively, mobile phone 26 
may transmit all or some inputs from the user to XIVR server 202 for processing. For 

15 example, menu navigation responses may be processed locally, while the ultimate request for 
data is processed at XIVR server 202 or other coupled systems. 

For simple applications, the downloaded application logic may preferably include a 
table or small database of various optional answer messages for the voice application to use. 
For example, with a movie-times voice application, the different movie selections and times 

20 may be included in the application logic initially downloaded to mobile phone 26. Therefore, 
when a user requests the times for a particular movie at a particular theater, the voice 
application locally runs through a look-up table downloaded with the application logic to find 
and play the corresponding start times for the requested movie. Depending on the XIVR 
application or the memory considerations used when designing the system, the answer 

25 messages may comprise any combination of graphics, text, or aural information. The visual 
information, i.e., graphics and text, may preferably be presented on a communication device's 
display, while the aural information may preferably comprise digitized, pre-recorded voice 
files and/or data files used with text-to-speech synthesis within the application logic or within 
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the host communication device. The aural information may preferably be presented to the 
user on a resident speaker or other transducing mechanism. 

For more complex applications, or applications that handle sensitive data, such as 
financial information, mobile phone 26 would preferably establish additional, subsequent 
5 connections, or simply re-activate a "suspended" socket connection to XIVR server 202 in 
order to retrieve the requested information. Referring again to the banking example, after the 
user requests balance information, mobile phone 26 preferably processes the verbal response, 
determines the action requested, and then preferably accesses the socket connection to XIVR 
server 202 again. Because the next connection is to obtain further information corresponding 
10 to the downloaded voice response application, a code may preferably be added to the header 
of the data transmission indicating to XIVR server 202 that the following socket connection 
is a "continued" connection. This code advantageously prevents XIVR server 202 from 
attempting to download the initial application logic again. 

The data transmitted from mobile phone 26 would also preferably contain the 
15 processed request from the user to obtain the user's account balance. XIVR server 202 

preferably uses the request and other transmitted user input to find the account information in 
database 207. XIVR server 202 will then preferably package the responsive information and 
send it back to mobile phone 26 for presentation to the user. The packaged response may 
preferably be encrypted and may comprise an audio file for playback on mobile phone 26's 
20 audio player, a text file for use in a text-to-speech synthesis process at mobile phone 26, 

and/or text or graphics files for visual presentation on mobile phone 26 's display. Thus, for 
all requested information that is not downloaded along with the downloaded application 
logic, mobile phone 26 will re-establish the data connection with XIVR server 202 to obtain 
the appropriate responsive information. 

25 Extensible voice response applications preferably may also call other applications 

resident on XIVR server 202, or otherwise coupled thereto, as a part of the voice application 
processing. For example, a stock brokerage application may have several different and 
extensible functions available for a user to access. However, downloading the entire 
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application may be prohibitively time and resource consuming, for communication devices 
with limited memory and/or processing capabilities, such as mobile phone 26 and the like. 
Furthermore, not every user will want to execute all available functions. Therefore, it would 
be a more efficient use of memory and of the available bandwidth between the 
5 communication device and XIVR server 202 to only load portions of a complete voice 
response application. In such an application, the different functions may preferably be 
broken into different executable modules corresponding to available features. Thus, the first 
downloaded module may preferably include only the functions necessary to check balances 
and stock prices. It may also have options to buy and sell stocks. When a user selects the 
10 option to sell stocks, the downloaded application logic preferably causes mobile phone 26 to 
re-establish or unsuspend the socket connection with XIVR server 202 to download the "sell" 
module. The sell module will preferably replace the initial module in mobile phone 26 and 
execute its voice messaging and functionality in a similar manner. 

With regard to memory resources, it should be noted that a preferred embodiment of 
15 the present invention would advantageously manage the application logic stored on XIVR 

server 202 in accordance with the memory limitations of the particular communication device 
connecting to the system. XIVR server 202 would preferably include software to break down 
the application logic modules into appropriately-sized sub-modules suitable for running on 
the limited-memory devices, such as mobile phone 26 or hand-held computers 22 and 27. 
20 The system would preferably be able to read the type of connected device through the header 
data of the connection packets transmitted from the communication device. Upon connection 
and recognition of a limited-memory device, such as hand-held computer 27, XIVR server 
202 preferably downloads the first executable sub-module sized according to the memory 
limitations of hand-held computer 27. As the user completes execution of the first sub- 
25 module, hand-held computer 27 accesses the data connection socket with XIVR server 202 
and preferably downloads the next executable sub-module. This paging sequence would 
preferably continue until the application is ended, either by the user or the system. 
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In an alternative embodiment of the present invention, there may be a desire to 
connect a user with an agent to handle complex matters or if the user simply desires to speak 
with a live person. To handle the live agent or operator situation, the voice response 
application may include a script to establish a voice connection or voice call between the user 
5 and an agent. As the script is accessed, the communication device will connect with an agent 
using the information from within the downloaded application. 

In a further alternative embodiment of the present invention, XIVR server 202 may 
directly connect an agent to a user by incorporating the universal gateway capabilities of the 
aforementioned, co-pending, commonly assigned application entitled, "INTERACTIVE 

10 INFORMATION TRANSACTION PROCESSING SYSTEM WITH UNIVERSAL 

TELEPHONY GATEWAY CAPABILITIES." XIVR server 202 may preferably directly 
connect a user using either a synchronous or asynchronous voice-connection with an agent 
also using either a synchronous or asynchronous voice-connection. The capabilities 
described in the above-styled application allows for direct connection of the dissimilar 

15 connection types. 

In operation of an alternative embodiment of the present invention, a caller may begin 
an interactive voice response session with one XIVR and then hyperlink to another XIVR to 
execute or operate another voice application. FIGURE 2B illustrates the alternative 
embodiment of the present invention in which voice application hyperlinking may be used. 

20 For example, a caller using landline phone 21 may preferably access XIVR server 202 

over data links 201 to begin receiving the program code to operate a first voice response 
application. During the course of running the first application, the caller may preferably be 
presented a choice or given a hyperlink option to go to a second XIVR to run another voice 
response application. With reference to the banking example, the first application may give 

25 the user an option to open a brokerage account with a related brokerage company. On 

choosing this option, landline phone 21 preferably establishes a connection with XIVR server 
203 over Internet 200 using data connections 204. XIVR server 203 preferably downloads 
the application logic to landline phone 21 for running the brokerage account voice response 
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application. The caller would then preferably interact with the brokerage account application 
running on landline phone 21, which will then communicate the response data and any other 
necessary information for opening the brokerage account with XIVR server 203. Server 203 
will also preferably communicate with database 205 to store and retrieve information needed 
5 by landline phone 21 to further operate and complete the brokerage account application. 

FIGURE 3 illustrates an alternative embodiment of the present invention configured 
to initiate a connection with the inventive XIVR 202 using a voice/phone connection. 
Typical operation of prior art IVR systems begins with a user establishing a voice connection 
to the IVR. The present alternative embodiment uses a voice connection to initiate the XIVR 
10 system. The system is preferably accessed using any one of communication devices 20 - 27. 
To initiate the system, the communication device, e.g., mobile phone 26, places a voice call 
to XIVR server 202. The wireless connection is processed from wireless network 105 
through PSTN 100 to XIVR server 202 using trunk 300. Trunk 300 connects to traditional 
ports preferably included on XIVR server 202. 

15 In operation, mobile phone 26 preferably voice-connects to XIVR server 202, which 

initiates a preliminary voice response script. Preferably, through this initial script, all 
necessary information regarding the target address of mobile phone 26 is advantageously 
established. Such address information may be gathered either automatically, through calling 
data such as automatic number identification (ANI), dialed number identification service 

20 (DNIS), mobile identification number (MIN), or the equipment serial number (ESN), or 
manually through question and answer sequences with the user. This initial script may 
additionally or alternatively solicit information with respect to an application a caller wishes 
or needs to be implemented by XIVR server 202. It should also be noted that XIVR server 
202 may have a database of caller associated data. Such a database may contain caller 

25 specific information such as IP address or other data information used in establishing the data 
connection. XIVR server 202 may then use the calling data, such as the ANI, DNIS, MIN 
and/or ESN, to cross-reference the database for the appropriate connection address 
information. Once the address information has been determined, XIVR server 202 preferably 
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establishes a data socket connection and begins downloading the appropriate modules or sub- 
modules of the application logic to mobile phone 26 over Internet 200 and wireless network 
105 through data connections 201 . In order to minimize the delay, the application logic may 
preferably begin executing on mobile phone 26 prior to the completion of the initial 
5 download. This advantageously presents a more linear interface with the user. 

It should be noted that while the foregoing examples noted use of mobile phone 26 for 
the inventive system, the present invention is not limited to operation solely with mobile 
communication devices. Landline phones 20 and 21 may preferably access and execute the 
extensible voice response applications from XIVR server 202 using PSTN 100 and Internet 
10 200 networks to establish a data connection. Moreover, other communication devices such as 
hand-held computers and desktop or laptop computers may also be used with a preferred 
embodiment of the present invention. 

It should be noted that in the alternative embodiment shown in FIGURE 3, XIVR 
server 202 may preferably use simultaneous, or duplexed voice and data connections with 

15 communication devices 20-27. This would preferably allow XIVR server 202 to 

simultaneously, or nearly simultaneously, execute the preliminary voice response script while 
downloading the application logic to the communication device. For example, phone 21 
connecting to XIVR server 202 and Internet 200 using PSTN 100 may preferably maintain 
simultaneous voice and data connections with XIVR server 202 if the user subscribes to 

20 digital subscriber line (DSL) technology. Wireless communication devices 25 - 27 may 
preferably duplex between voice and data connections with XIVR server 202, under the 
current mobile communication systems, such as the time division-based global system for 
mobile communications (GSM) and the digital code division multiple access (CDMA) 
systems. In the near future, however, mobile systems will preferably support a simultaneous 

25 data and voice connection to XIVR server 202. The contemplated Third Generation (3G) 
systems which will utilize developing standards, such as wideband CDMA (WCDMA), and 
general packet radio service (GPRS), which will overlay a packet-switched network onto the 
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GSM and other time division based systems, each support simultaneous voice and data 
connections. 

FIGURE 4 shows the unique internal structures of an XIVR server of the preferred 
embodiment of the present invention. XIVR server 202 preferably extends traditional IVR 
5 functionality to each connecting communication device. In an alternative embodiment, XIVR 
server 202 also preferably provides traditional IVR functionality in order to facilitate the 
extensible application capability. Thus, XIVR server 202 comprises much of the same 
equipment found in traditional IVRs (equipment not shown). However, novel features of 
XIVR server 202 provide the ability to transport voice response functionality to external 
10 devices. 

Application logic storage 400 preferably comprises memory to store the executable 
voice response applications. As XIVR server 202 is contacted through PSTN trunk 300 or 
data connections 201, executable copies of the application logic are preferably downloaded 
over data connections 201 to the contacting communication units. The voice response 

15 applications are advantageously developed with development environment 401 through 

computer workstations 40 and/or 41. Unlike traditional IVRs, which may be programmed in 
proprietary languages generally requiring resident interpreters or compilers, the XIVR system 
preferably uses an extensible language, which is advantageously transferable to a host 
processor with the components used to run the given application. Languages such as 

20 hypertext markup language (HTML), extensible markup language (XML), VoiceXML, and 
the like may be utilized in providing the extensibility to program voice response applications 
for use with XIVR server 202. 

In an alternative embodiment described further below, XIVR server 202 may also 
preferably facilitate voice-browsing the Internet. In order to accomplish this function, XIVR 
25 server 202 also preferably comprises HTTP translator 402 (FIGURE 4). As XIVR server 202 
browses through the Internet, it reads the HTML web pages and advantageously converts the 
HTML into a compatible format for a voice response application, such as VoiceXML for 
example, and/or for interfacing with callers through various ones of communication devices 
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20 - 27. The HTTP is then preferably converted into the appropriate transport protocol and 
the web pages, or portions thereof, are downloaded to the connecting communication device. 
The translation executed by HTTP translator 402 preferably converts text-to-speech and 
notes hyperlinks as special voice cues to inform users of the executable links available. 
5 Additionally or alternatively, portions of the web site may be visually presented as text or 
graphics on a display associated with or connected to the communication device. These 
conversion components are advantageously included in the application logic downloaded to 
the connecting devices. 

It should be also noted that web sites or web pages may preferably be implement 
10 using a compatible extensible voice programming language, such as VoiceXML. Therefore 
the XIVR system may not always be required to perform translation of incompatible formats 
or protocols. 

FIGURE 5 illustrates a preferred embodiment communication device compatible with 
an XIVR system of the present invention. The device represented in FIGURE 5 is a mobile 

15 telephone. However, it should be noted that various types of communication devices may be 
adapted to operate with the inventive XIVR system. In addition to the typical equipment 
found in many communication devices, such as antenna, transducers, displays, etc., mobile 
phone 5 preferably includes processor 50, digital signal processor (DSP) 52, and memory 51. 
Traditional mobile phones already have processing units whether programmable, embedded, 

20 or application specific. Mobile phone 5 may operate successfully simply by changing the 

single processor of a traditional phone to a more powerful model. A powerful single process 
would generally be capable of performing the voice processing features as well as the 
standard phone features. However, in a preferred embodiment, mobile phone 5 comprises 
both additional processor 50 and DSP 52 in order to optimize the application execution. 

25 Memory 51 provides storage for the downloaded application logic, response data, 

reference tables, and/or any other form of data or information used in the execution of a voice 
response application. With the advances in memory technology, it is possible to place a large 
amount of memory on a small device with very little power consumption. In fact, the 



870224 



47524-P125US-10025004 



20 



PATENT 



memory may be configured with its own backup power through using an attached battery. In 
alternative embodiments, small devices, such as phones and hand-held computers may be 
configured to accept external memory cards such as floppy disks or compact flash-type cards. 
These small memory units could greatly expand the memory capability of a compatible 
5 phone. 

It should also be noted that alternative embodiments may be configured to accept 
external or internal additional processors or memory. In such embodiments, the memory and 
processor power of such communication devices may be selectively enhanced and/or 
upgraded. 

10 FIGURE 6 illustrates an alternative embodiment of the present invention configured 

to provide voice-browsing the Internet. Communication devices 20, 21, 23, and 26 are each 
phones that typically are not used to browse Internet 200. Mobile phone 26 may be WAP 
compatible, providing limited access to Internet 200. However, WAP is typically used only 
with wireless applications, or applications written specifically for WAP-compatible mobile 

15 phones, thus not providing full access to available web content. WAP is also browsed 
exclusively by using keypresses. 

In a preferable example of voice-browsing operation, on an alternative embodiment of 
the present invention, landline phone 20 preferably accesses XIVR server 202 over data 
connections 201 through PSTN 100 and Internet 200. XIVR server 202 may check database 

20 207 to determine whether phone 20 subscribes to the voice-browsing service. If phone 20 
subscribes to voice-browsing, the application logic is preferably downloaded and run on 
phone 20. The application preferably requests the user to speak/spell the URL of the website 
to access. Additionally or alternatively, a list of favorites may be stored in a user-accessible 
database disposed either on the communication device or on a database associated with XIVR 

25 server 202. If such a favorites list is disposed on a database associated with XIVR server 
202, server 202 would preferably download the list to the communication device at the 
initialization of each application session. Phone 20 preferably returns the URL to XIVR 
server 202, which accesses the website through data connections 201. XIVR server 202 
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preferably retrieves the HTML code for the website and translates it, if necessary, into a 
compatible language, such as XML or VoiceXML. XIVR server 202 then preferably 
downloads the web page to phone 20, which begins playing and/or displaying the text and 
hyperlinks from the accessed site. 

5 In one version of this preferred alternative embodiment, XIVR server 202 may begin 

translating each of the web pages corresponding to the available hyperlinks on the page 
downloaded to phone 20. This pre-translation will preferably increase the speed with which 
phone 20 receives the next web page, if the user chooses one of the hyperlinks. Alternatively, 
XIVR server 202 may preferably access available counter or statistical software resident on 
10 the target web pages to determine the hyperlinks most likely chosen by a user. The 

alternative predictive voice-browser will preferably make a statistical choice regarding the 
resources to expend pre-translating web pages. 

It should be noted that in the application of the present invention, the communication 
device may be configured to semi-permanently store the application logic and/or the 

15 information retrieved by the voice response application. Thus, a user would preferably not 
have to re-access the XIVR system to obtain the same data or information. Once a session 
has been completed, the communication device would have the information and/or the 
portion of the voice response application available for the user without requiring a renewed 
connection to the XIVR system. A user obtaining an account balance would therefore be able 

20 to recall the balance locally on the phone until such time as either the user or the 

communication device erases the corresponding memory. Such a data retention policy may 
be predefined by the communication device manufacturer, the XIVR system provider, and/or 
the user. Additionally or alternatively, the user may manually choose an option to erase or 
save such data or application logic. 

25 It should be noted that while the descriptions of the alternative embodiments of the 

present invention shown in FIGURES 2, 3, and 6 show data connections with XIVR 202 to 
the Internet, the present invention may also be connected through other data networks. XIVR 
202 may operate internally within a company through the company's intranet. Employees 
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would therefore be able to access the XIVR system at work or from mobile locations by 
calling into the company intranet. 

It should further be noted that the preferred embodiment of the present invention is 
not limited to connecting with mobile communication devices indirectly through either the 
5 Internet or the PSTN. As GPRS systems are installed in wireless networks, it will be possible 
to asynchronously connect directly to the XIVR system. An XIVR configured to directly 
communicate with a GPRS-enabled mobile device would also preferably incorporate a 
wireless communication interface equally compatible with the GPRS system. 

Although the present invention and its advantages have been described in detail, it 
10 should be understood that various changes, substitutions and alterations can be made herein 
without departing from the spirit and scope of the invention as defined by the appended 
claims. Moreover, the scope of the present application is not intended to be limited to the 
particular embodiments of the process, machine, manufacture, composition of matter, means, 
methods and steps described in the specification. As one of ordinary skill in the art will 
15 readily appreciate from the disclosure of the present invention, processes, machines, 

manufacture, compositions of matter, means, methods, or steps, presently existing or later to 
be developed that perform substantially the same function or achieve substantially the same 
result as the corresponding embodiments described herein may be utilized according to the 
present invention. Accordingly, the appended claims are intended to include within their 
20 scope such processes, machines, manufacture, compositions of matter, means, methods, or 
steps. 
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WHAT IS CLAIMED IS: 

1 . An interactive voice response system comprising: 

an application server having application logic and information stored thereon, said 
application logic for defining at least one voice response application; 

a communication device for establishing at least one connection with said application 
5 server, wherein said application server communicates said application logic to said 
communication device responsive to one of said established connections; and 

a processor connected to said communication device to execute said communicated 
application logic and locally administer said at least one voice response application. 

2. The system of claim 1 further comprising: 

a data network interface in communication with said application server for retrieving 
information responsive to said at least one voice response application locally administered at 
said communication device. 

3. The system of claim 2 wherein said data network is the Internet. 

4. The system of claim 1 wherein said communication device establishes at least 
one connection with another application server responsive to said information retrieved by 
said application server, said another application server having another application logic stored 
thereon defining at least one other voice response application for execution on said 

5 communication device. 
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5. The system of claim 1 further comprising: 

translation logic for converting said retrieved information and applications into a 
format compatible with said application logic. 

6. The system of claim 1 wherein said application server divides said at least one 
voice response application into one or more selectively-sized, executable sub-modules, 
wherein said size is selected responsive to memory limitations of said communication device. 

7. The system of claim 6 wherein said communication device obtains one of said 
one or more sub-modules for execution. 

8. The system of claim 7 wherein said communication device obtains a next one 
of said one or more sub-modules after completing execution of said one sub-module. 

9. The system of claim 1 further comprising: 

a user interface disposed on said communication device for accepting input from a 
user responsive to said at least one voice response application. 

10. The system of claim 9 wherein said at least one voice response application 
prompts said user to input information in one or more formats chosen from the group 
comprising: 

dual tone multiple frequency (DTMF); 

speech; and 

text. 



870224 



47524-P 1 25US- 1 0025 004 



25 



PATENT 



11. The system of claim 10 wherein said processor processes said user input 
locally according to said at least one voice response application. 



12. The system of claim 1 1 further comprising voice recognition logic. 

13. The system of claim 12 wherein said voice recognition logic is speaker 
dependent. 



14. The system of claim 12 wherein said voice recognition logic is speaker 
independent. 

15. The system of claim 12 wherein said voice recognition logic is disposed 
permanently on said communication device. 

16. The system of claim 15 wherein said voice recognition logic is downloaded to 
said device from said application server. 

17. The system of claim 12 wherein said voice recognition logic is disposed on 
said application server and wherein said voice recognition logic receives digital voice packets 
from said communication device. 
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18. The system of claim 9 further comprising: 

an audio transducer disposed on said communication device for playing aural 
segments to said user in accordance with the operation of said at least one voice response 
application; and 

a display disposed on said communication device for presenting visual information to 
said user in accordance with the operation of said at least one voice response application. 

19. The system of claim 18 wherein said aural segments comprise digitized voice 

files. 

20. The system of claim 18 wherein said aural segments comprise text messages 
converted to speech at said communication device. 

21 . The system of claim 1 8 wherein information responsive to said at least one 
voice response application is presented on said communication device according to a set of 
preferences preselected by said user. 

22. The system of claim 21 wherein a set of potentially responsive information is 
downloaded to said communication device with said application logic. 

23. The system of claim 22 wherein said responsive information is obtained from 
said application server when said set of potentially responsive information is not responsive 
to said at least one voice response application. 

24. The system of claim 23, wherein said responsive information obtained from 
said application server is located on one of said application server and said data network. 
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25. The system of claim 9 wherein said communication device uses a packet 
switching network to connect to said application server. 

26. The system of claim 9 wherein said communication device initiates said 
application server connection over a voice connection and receives said information and 
application logic over a data connection. 

27. The system of claim 26 wherein said voice connection comprises a circuit 
switched network and said data connection comprises a packet switched network. 

28. The system of claim 9 wherein said communication device communicates with 
said application server using a blended voice and data network. 

29. The system of claim 1 wherein said communication device is chosen from the 
group comprising: 

a mobile phone; 

a hand-held computer; 

a landline phone; 

a desktop computer; and 

a data network phone. 

30. The system of claim 1 wherein said application logic comprises VoiceXML. 
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31. A method for providing an interactive voice response application to a user on a 
communication unit comprising the steps of: 

establishing an initial connection between said communication unit and a multimedia 

server; 

5 transmitting software code defining said interactive voice response application to said 

communication unit; 

executing said software code on said communication unit to run said interactive voice 
response application; and 

providing information to said user responsive to requests made pursuant to said 
10 interactive voice response application. 

32. The method of claim 31 further comprising the step of: 
retrieving said information responsive to said requests. 

33. The method of claim 32 wherein said communication unit retrieves said 
responsive information from a set of information downloaded with said software code. 

34. The method of claim 33 wherein said communication unit retrieves said 
responsive information from said multimedia server. 

35. The method of claim 34 wherein said multimedia server obtains said 
responsive information from one of an internal database and a data network. 
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36. The method of claim 35 further comprising the step of: 

converting said responsive information into a format compatible with said interactive 
voice response application. 

37. The method of claim 31 wherein said transmitting step further comprises the 

step of: 

dividing said software code into selectively-sized segments responsive to a memory 
capacity of said communication unit. 

38. The method of claim 3 1 further comprising the step of: 

receiving input from said user responsive to voice messages played by said interactive 
voice response application. 

39. The method of claim 38 wherein said input is chosen for a group comprising: 
voice input; 

dual tone multiple frequency (DTMF); and 
text input. 

40. The method of claim 39 further comprising the step of processing said voice 

input. 

41 . The method of claim 40 wherein said voice processing is done by said 
communication device. 
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42. The method of claim 40 wherein said voice processing is done by said 
multimedia server. 

43. The method of claim 42 wherein said voice processing is speaker dependent. 

44. The method of claim 42 wherein said voice processing is speaker independent. 

45. The method of claim 34 further comprising the step of: 

reestablishing a subsequent connection between said communication unit and said 
multimedia server to retrieve said responsive information. 

46. The method of claim 37 further comprising the step of: 

downloading a next selectively-sized segment after execution of said transmitted 
segment. 

47. The method of claim 3 1 wherein said initial connection is implemented over a 
data network. 

48. The method of claim 31 wherein said initial connection is implemented over a 
voice network and said transmitting step is implemented over a data network. 
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49. The method of claim 3 1 further comprising the step of: 

establishing communication between said user and an operator responsive to a request 
made pursuant to said interactive voice response application. 

50. The method of claim 49 wherein said communication is established using a 
data network. 

5 1 . The method of claim 49 wherein said communication is established using a 
voice network. 

52. The method of claim 49 wherein said communication is established using a 
combination of a voice network and a data network. 
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53. A system for implementing an interactive voice response application on a 
communication device comprising: 

a central server in communication with a data network; 

extensible application code disposed on said central server, said code defining an 
5 interactive voice response application; 

memory disposed on said communication device for storing a copy of said extensible 
application code, wherein said communication device downloads said copy from said central 
server using said data network; and 

a processor disposed on said communication device for running said copy of said 
10 extensible application code and administering said interactive voice application substantially 
independent from said central server. 

54. The system of claim 53 wherein said interactive voice application provides 
information responsive to requests made in administering said interactive voice application. 

55. The system of claim 54 wherein said responsive information is provided from 
information downloaded with said copy of said extensible application code. 

56. The system of claim 54 wherein said communication device communicates 
with said central server to obtain said responsive information. 

57. The system of claim 56 wherein said central server retrieves said responsive 
information from an internal memory location. 

58. The system of claim 56 wherein said central server retrieves said responsive 
information from said data network. 
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59. The system of claim 58 further comprising: 

conversion code disposed on said central server to convert responsive information 
retrieved from said data network into a format compatible with said interactive voice 
application. 

60. The system of claim 53 further comprising voice processing logic to process 
input spoken by a user into said communication device. 

61 . The system of claim 60 wherein said voice processing logic is disposed on 
said communication device. 

62. The system of claim 61 wherein said communication device transmits said 
input to said central server using a data connection. 

63. The system of claim 60 wherein said voice processing logic is speaker 
dependent. 

64. The system of claim 60 wherein said voice processing logic is speaker 
independent. 
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65. The system of claim 53 further comprising: 

application management software disposed on said central server for dividing said 
extensible application code into selectively-sized sub-modules, wherein said selected size is 
determined from memory limitations of said communication device. 

66. The system of claim 65 wherein said communication device downloads a next 
sub-module after completing execution of a current sub-module. 

67. The system of claim 53 wherein said communication device initiates said 
download of said copy by communicating with said central server using a voice network. 

68. The system of claim 54 further comprising: 

a connection resource for connecting a user to an agent responsive to said requests 
made in administering said interactive voice application, wherein said connection allows live 
voice communication between said user and said agent. 

69. The system of claim 68 wherein said connection resource connects said user 
and said agent using one of said data network and a voice network. 

70. The system of claim 68 wherein said connection resource connects said user 
and said agent using a combination of said data network and a voice network. 
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71. A computer program product defining an interactive multimedia response 
application for use on a communication device, said computer program product comprising: 

at least one function for operation of said interactive multimedia response application 
corresponding to a predefined set of at least one desired application feature; 

5 a multimedia display driver for processing multimedia information for presentation to 

a user; 

application logic for providing multimedia information to said multimedia display 
driver for presenting user prompts according to operation of said at least one function; and 

multimedia input interface for processing multimedia input. 

72. The computer program product of claim 71 wherein said multimedia display 
driver comprises: 

an audio media player for presenting audio files to said user; and 

a graphical driver for presenting visual information on a display of said 
5 communication device. 

73. The computer program product of claim 72 wherein said multimedia display 
driver further comprises a speech synthesizer for converting text information to speech for 
presentation to said user. 
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74. The computer program product of claim 71 wherein said multimedia input 
interface comprises: 

a dual tone multiple frequency (DTMF) interface for accepting DTMF input signals 
from said user; 

a voice processor for receiving voice input from said user; and 
a data interface for receiving text input from said user. 

75. The computer program product of claim 74 wherein said voice processor 
comprises speaker dependent voice processing. 

76. The computer program product of claim 74 wherein said voice processor 
comprises speaker independent voice processing. 
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77. A method for providing an interactive voice response application to a user on a 
communication unit comprising the steps of: 

launching a connection between said communication unit and a multimedia server; 

downloading application code defining said interactive voice response application to 
said communication unit; and 

running said application code on said communication unit to execute said interactive 
voice response application so that said user can have a voice response interactive session 
controlled, at least in part, by said downloaded application code. 

78. The method of claim 77 further comprising the step of: 

retrieving said information responsive to said voice response interactive session. 

79. The method of claim 78 wherein said communication unit retrieves said 
responsive information from one of a database internal to said communication unit and a 
database external to said communication unit. 

80. The method of claim 77 wherein said connection between said communication 
unit and said multimedia server comprises a data socket connection. 
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81. A method for obtaining multimedia information on a communication device using 
a locally administered interactive voice application, said method comprising the steps of: 

actuating said communication device to initiate an interactive voice response session; 

receiving application logic into said communication device to locally administer said 
5 interactive voice response session; 

observing multimedia prompts on said communication device provided by said 
interactive voice response session; 

providing said interactive voice response session multimedia input responsive to said 
observed multimedia prompts, wherein said multimedia input is processed by said 
10 communication device; and 

observing multimedia information on said communication device provided by said 
interactive voice response session responsive to said processed multimedia input. 

82. The method of claim 81 wherein said multimedia prompts comprise one of aural 
segments presented over an audio transducing mechanism of said communication device and 
visual information presented using a visual display of said communication device. 

83. The method of claim 81 wherein said multimedia prompts comprise a 
combination of aural segments presented using an audio transducing mechanism and visual 
information presented using a visual display. 



870224 



47524-P 1 25US- 1 0025004 



39 



PATENT 



84. The method of claim 8 1 wherein said multimedia input is chosen from the group 
comprising: 

speech; 

dual tone multiple frequency (DTMF) signals; and 
text. 

85. The method of claim 81 wherein said multimedia information comprises one of 
aural segments presented over an audio transducing mechanism of said communication device 
and visual information presented using a visual display of said communication device. 

86. The method of claim 81 wherein said multimedia information comprises a 
combination of aural segments presented using an audio transducing mechanism and visual 
information presented using a visual display. 

87. The method of claim 81 further comprising the step of retrieving said multimedia 
information responsive to said multimedia input. 

88. The method of claim 86 wherein said multimedia information is retrieved from 
a database local to said communication device. 
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89. The method of claim 86 wherein said multimedia information is retrieved from 
a database external to said communication device when said information contained in said local 
database is not responsive to said multimedia input. 

90. The method of claim 81 further comprising the step of storing said multimedia 
information onto said communication device to provide access to said multimedia information 
substantially independent from said interactive voice response session. 

91 . The method of claim 8 1 further comprising the step of selectively forwarding 
said multimedia information to another communication device. 
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EXTENSIBLE INTERACTIVE VOICE RESPONSE 

ABSTRACT OF THE DISCLOSURE 

The present invention discloses a system and method for providing interactive voice 
response (IVR) applications executable on individual communication devices. Unlike current 
IVR applications that run from centralized voice servers, the present invention describes a 
system in which communication units initiate communication with a multimedia server over a 
5 data network such as the Internet and download extensible copies of voice response 

applications. The communication device then runs the voice response applications, thus, 
locally administering the voice messages and accepting the voice or data input from a user. 
The multimedia server may preferably divide the application software into executable 
segments to accommodate communication devices with limited memory resources, such as 

10 mobile phones and hand-held computers. The system and method may implement different 
level of complexity by breaking application functionality into modules and the sub-modules. 
For multi-module applications, the communication units will re-establish communication 
links with the multimedia server to download each necessary or subsequent module or sub- 
module. The system and method may also allow users to connect directly to agents or 

15 operators to perform tasks that are too complex for efficient automation. The system may 
connect users and agents using any combination of a data network and voice network. The 
implementation of the voice response application using the data network connection allows a 
reduction in the number of telephone ports into an IVR and also allows multiple users to 
access the IVR over the same line, because of the asynchronous nature of the data network. 
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PATENT 



COMBINED DECLARATION AND POWER OF ATTORNEY 

(ORIGINAL, DESIGN, NATIONAL STAGE OF PCT, SUPPLEMENTAL, DIVISIONAL, 
CONTINUATION, OR C-I-P) 

As a below named inventor, I hereby declare that: 

TYPE OF DECLARATION 

This declaration is of the following type: 

original, 
design, 
supplemental. 

national stage of PCT. 

divisional, 
continuation. 

continuation-in-part (C-I-P). 

INVENTORSHIP IDENTIFICATION 

My residence, post office address and citizenship are as stated below, next to my name. I believe that I 
am the original, first and sole inventor (if only one name is listed below) or an original, first and joint 
inventor (if plural names are listed below) of the subject matter that is claimed, and for which a patent 
is sought on the invention entitled: 

TITLE OF INVENTION 

Extensible Interactive Voice Response 
SPECIFICATION IDENTIFICATION 

The specification of which: 

(a) is attached hereto. 

(b) □ was filed on , as □ Serial No. 0 / 

C and was amended on 

(c) □ was described and claimed in PCT International Application No. _ 

on 

and as amended under PCT Article 19 on 

any). 



□ 
□ 

□ 

□ 
□ 
□ 



. or 

. (if applicable). 
filed 



(if 
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PATENT 



SUPPLEMENTAL DECLARATION (37 CFR 1.67(b)) 

I hereby declare that the subject matter of the 

d attached amendment 

D amendment filed on 



was part of my/our invention and was invented before the filing date of the original application, 
above identified, for such invention. 

ACKNOWLEDGMENT OF REVIEW OF PAPERS AND DUTY OF CANDOR 

I hereby state that I have reviewed and understand the contents of the above-identified 
specification, including the claims, as amended by any amendment referred to above. 

I acknowledge the duty to disclose information, which is material to patentability as defined in 
37, Code of Federal Regulations, § 1.56, 

□ in compliance with this duty, there is attached an information disclosure statement, in 
accordance with 37 CFR 1.98. 

PRIORITY CLAIM (35 U.S.C. § 119(a)-(d)) 

I hereby claim foreign priority benefits under Title 35, United States Code, § 1 19(a)-(d) of any 
foreign application(s) for patent or inventor's certificate or of any PCT international application(s) 
designating at least one country other than the United States of America listed below and have also 
identified below any foreign application(s) for patent or inventor's certificate or any PCT international 
application(s) designating at least one country other than the United States of America filed by me on the 
same subject matter having a filing date before that of the application(s) of which priority is claimed. 

(d) IS no such applications have been filed. 

(e) □ such applications have been filed as follows. 



PRIOR FOREIGN/PCT APPLICATION(S) FILED WITHIN 12 MONTHS 
(6 MONTHS FOR DESIGN) PRIOR TO THIS APPLICATION 
AND ANY PRIORITY CLAIMS UNDER 35 U.S.C. § 119(a)-(d) 



Country 
(Or Indicate If 

PCT) 


Application 
Number 


Date of Filing 
Day, Month, Year 


Priority Claimed 
under 35 Use 119 








[ ] Yes 


[ ]No 








[ ] Yes 


[ ]No 








[ ] Yes 


[ ]No 
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PATENT 



CLAIM FOR BENEFIT OF PRIOR U.S. PROVISIONAL APPLICATION(S) 

(35U.S.C. § 119(e)) 

I hereby claim the benefit under Title 35, United States Code, § 119(e) of any United States 
provisional application(s) listed below: 

PROVISIONAL APPLICATION NUMBER FILING DATE 

/ 

/ 

/ 



CLAIM FOR BENEFIT OF EARLIER U.S./PCT APPLICATION(S) 
UNDER 35 U.S.C.§ 120 

□ I hereby claim the benefit under Title 3 5 , United States Code § 1 20 of any United States 
application(s) or § 365(b) of any PCT international application designating the United States of America, 
listed below and, insofar as the subject matter of each of the claims of this application is not disclosed 
in the prior U.S. or PCT international application in the manner provided by the first paragraph of Title 
35, U.S.C. § 1 12, 1 acknowledge the duty to disclose material information as defined in Title 37, Code 
of Federal Regulations § 1 .56(a) which occurred between the filing date of the prior application and the 
national or PCT international filing date of this application. 



Application Serial 


Filing Date 


Status 





















ALL FOREIGN APPLICATION(S), IF ANY, FILED MORE THAN 12 MONTHS 
(6 MONTHS FOR DESIGN) PRIOR TO THIS U.S. APPLICATION 



POWER OF ATTORNEY 

I hereby appoint the following practitioner(s) to prosecute this application and transact all 
business in the Patent and Trademark Office connected therewith: 

David H. Tannenbaum, Reg. No. 24,745; 
Michael A. Papalas, Reg. No. 40,381; 
R. Ross Viguet, Reg. No. 42,203; 
Michael J. Fogarty, III, Reg. No. 42,541; 
Jody Bishop, Reg. No. 44,034; 
Thomas J. Meaney, Reg. No. 41,990; 
Matthew Jones, Reg. No. 44,810 and 
William B. Tiffany, Reg. No. 41,347. 
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PATENT 



SEND CORRESPONDENCE TO 



DIRECT TELEPHONE CALLS TO: 



David H. Tannenbaum 

FULBRIGHT & JAWORSKI L.L.P. 

2200 Ross Avenue, Suite 2800 
Dallas, Texas 75201. 



R. Ross Viguet 
(214)855-8185 



DECLARATION 



I hereby declare that all statements made herein of my own knowledge are true and that all 
statements made on information and belief are believed to be true; and further that these statements were 
made with the knowledge that willful false statements and the like so made are punishable by fine or 
imprisonment, or both, under Section 1001 of Title 18 of the United States Code, and that such willful 
false statements may jeopardize the validity of the application or any patent issued thereon. 



NOTE: Carefully indicate the family (or last) name, as it should appear on the filing receipt and all other document. 

Full name of sole or first inventor: Dan Hammond 

Inventor's signature : 



SIGNATURE(S) 



Country of Citizenship: U.S.A. 

Residence: 17623 Cedar Creek Canyon, Dallas, Texas 75252 



Date: 



Post Office Address: 17623 Cedar Creek Canyon, Dallas, Texas 75252 
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